CDS

Accession Number TCMCG075C25162
gbkey CDS
Protein Id XP_007011805.2
Location complement(join(1151037..1151059,1151412..1151524,1151908..1151979,1152279..1152329,1152766..1152853,1152970..1153093,1153339..1153411,1153556..1153616,1153832..1154019,1154145..1154194,1154389..1154452,1154708..1154832))
Gene LOC18587758
GeneID 18587758
Organism Theobroma cacao

Protein

Length 343aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007011743.2
Definition PREDICTED: hydroxyproline O-galactosyltransferase HPGT3 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyltransferase 31 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K20854        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0006029        [VIEW IN EMBL-EBI]
GO:0006464        [VIEW IN EMBL-EBI]
GO:0006486        [VIEW IN EMBL-EBI]
GO:0006493        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008378        [VIEW IN EMBL-EBI]
GO:0009058        [VIEW IN EMBL-EBI]
GO:0009059        [VIEW IN EMBL-EBI]
GO:0009100        [VIEW IN EMBL-EBI]
GO:0009101        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0010384        [VIEW IN EMBL-EBI]
GO:0010404        [VIEW IN EMBL-EBI]
GO:0010405        [VIEW IN EMBL-EBI]
GO:0016740        [VIEW IN EMBL-EBI]
GO:0016757        [VIEW IN EMBL-EBI]
GO:0016758        [VIEW IN EMBL-EBI]
GO:0018193        [VIEW IN EMBL-EBI]
GO:0018208        [VIEW IN EMBL-EBI]
GO:0018258        [VIEW IN EMBL-EBI]
GO:0019538        [VIEW IN EMBL-EBI]
GO:0034645        [VIEW IN EMBL-EBI]
GO:0036211        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0043412        [VIEW IN EMBL-EBI]
GO:0043413        [VIEW IN EMBL-EBI]
GO:0044036        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044249        [VIEW IN EMBL-EBI]
GO:0044260        [VIEW IN EMBL-EBI]
GO:0044267        [VIEW IN EMBL-EBI]
GO:0070085        [VIEW IN EMBL-EBI]
GO:0071554        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:1901135        [VIEW IN EMBL-EBI]
GO:1901137        [VIEW IN EMBL-EBI]
GO:1901564        [VIEW IN EMBL-EBI]
GO:1901566        [VIEW IN EMBL-EBI]
GO:1901576        [VIEW IN EMBL-EBI]
GO:1990714        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGAGGGTTTACCTACGTCGACGAAAGCAGAGAGACGATGGAGATCGAAGAATCTACAGACATCCAAGCCTTCTTTGGTGATGGTCTTTTTCTCTTGCGTCGCTTGGCTCTACGTTGCTGGCCGGTTGTGGCAAGATGCAGAGAACAGAACATTGCTTGCTAATCTTCTTAAGAAGAACATTGAACAGAGACCAAAGGTTCTTACGGTCGAAGATAAGCTAATGGTCCTAGGATGCAAGGATCTAGAGAGGAGGATTGTAGAAGCAGAGATGGATTTGACGTTAGCTAAGAGTCAAGGATACCTGAAGCACCAGTTGCGACAAAGTGGTTCTTCAGATCAGAAGCTTCTTGCAGTTATTGGAGTCTATACTGGATTTGGTAGTCACTTGAAACGAATTACATTTAGAGGATCTTGGATGCCTAGAGGTGATGCATTAAAAAAGCTGGAGGAAAGAGGGGTTGTGATACGATTGGTGATTGGTCGGAGTGCTAATCGAGGTGATAGCTTGGATCGCAATATTGATGAGGAAAACCGTAAGACAAAGGATTTCTTTATTCTTGATGGTCATGAGGAGGCGCAAGAGGAGCTTCCTAAGAAAGCAAAATTTTTCTTCACTGCTGCAGTTCAAAATTGGGATGCAGAATTTTACGTCAAAGTTGATGATAATATTGACATTGGCCTTGAGGGATTGATTGGACTTCTTGAACAACGGCGTGGCCAAGATAGTGCTTATATTGGATGCATGAAGTCAGGAGAAGTGGTTGCTGAAGAGGGAAGGCCTTGGTTTGAACCAGAATGGTGGAAGTTTGGGGATGAGAAATCGTATTTTCGCCATGCCTCTGGTTCACTTCTTATACTCTCCAAAAATCTTGCTCAGTACATCAACGTAAACAGTGCATCTTTGAAGACTTATGCGCATGATGATATATCGGTGGGGTCCTGGATGATGGGTGTCCAGGCAACTTACATAGATGACAATCGTCTTTGCTGCAGTAGCATTAGACAAGATAAGGTGTGTTCCGTGGCTTGA
Protein:  
MEGLPTSTKAERRWRSKNLQTSKPSLVMVFFSCVAWLYVAGRLWQDAENRTLLANLLKKNIEQRPKVLTVEDKLMVLGCKDLERRIVEAEMDLTLAKSQGYLKHQLRQSGSSDQKLLAVIGVYTGFGSHLKRITFRGSWMPRGDALKKLEERGVVIRLVIGRSANRGDSLDRNIDEENRKTKDFFILDGHEEAQEELPKKAKFFFTAAVQNWDAEFYVKVDDNIDIGLEGLIGLLEQRRGQDSAYIGCMKSGEVVAEEGRPWFEPEWWKFGDEKSYFRHASGSLLILSKNLAQYINVNSASLKTYAHDDISVGSWMMGVQATYIDDNRLCCSSIRQDKVCSVA